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SEQUENCE LISTING 



c--> 


5 


(1) GENERAL INFORMATION: 




7 


(i) 


APPLICANT: Hillman, Jennifer L. 




8 




Goli, Surya K. 


c--> 


10 


(ii) 


TITLE OF INVENTION: NOVEL HUMAN MLS3 PROTEIN 




13 


(iii) 


NUMBER OF SEQUENCES : 6 




15 


(iv) 


CORRESPONDENCE ADDRESS: 




16 




(A) ADDRESSEE: Incyte Pharmaceuticals/ Inc. 




17 




(B) STREET: 3174 Porter Drive 




18 




(C) CITY: Palo Alto 




19 




(D) STATE: CA 




20 




(E) COUNTRY: USA 




21 




(F) ZIP: 94304 Ci 




23 


(V) 


COMPUTER READABLE FORM: 




24 




(A) MEDIUM TYPE: Diskette 




25 




(B) COMPUTER: IBM Compatible 




26 




(C) OPERATING SYSTEM: DOS 




27 




(D) SOFTWARE: FastSEQ for Windows Version 2 




34 


(vi) 


CURRENT APPLICATION DATA: 


c--> 


35 




(A) APPLICATION NUMBER: US/10/023,929 


c--> 


36 




(B) FILING DATE: 17-Dec-2001 




37 




( C) CLASSIFICATION : 




40 


(vii) 


PRIOR APPLICATION DATA: 




41 




(A) APPLICATION NUMBER: 08/805,965 




42 




(B) FILING DATE: 




44 


(viii) 


ATTORNEY/AGENT INFORMATION: 




45 




(A) NAME: Billings, Lucy J. 




46 




(B) REGISTRATION NUMBER: 36,749 




47 




(C) REFERENCE/DOCKET NUMBER: PF-0223 US 




49 


(ix) 


TELECOMMUNICATION INFORMATION: 




50 




(A) TELEPHONE: 415-855-0555 




51 




(B) TELEFAX: 415-845-4166 




52 




(C) TELEX: 




55 


(2) INFORMATION FOR SEQ ID NO : 1: 




57 


(i) 


SEQUENCE CHARACTERISTICS: 




58 




(A) LENGTH: 262 amino acids 




59 




(B) TYPE: amino acid 




60 




(C) STRANDEDNESS : single 




61 




(D) TOPOLOGY: linear 




63 


(vii) 


IMMEDIATE SOURCE: 




64 




(A) LIBRARY: BRAITUT02 




65 




(B) CLONE: 762280 




67 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
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i It: u 


It liC 


Arg 




ne L. 
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Asp 
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Glu 


Pro 


Glu 


Asp 


Pro 


Met 


Phe 


Leu 


10 
f \j 
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10 










15 




71 


Met 


Asp 


Pro 


fne 


Ala 


lie 


His 


Arg 


Gin 


His 


Met 


Ser 


Arg 


Met 


Leu 


Ser 


7 9 








20 










Z D 










30 






7 '\ 


Gly 


Gly 


Frie 


Gly 


Tyr 


Ser 


Pro 


Phe 


Leu 
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He 


Thr 
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Met 


74 
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7 ^ 
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Gly 


J. fir 
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Arg 


Arg 
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Ala 


Gly Ala 


Val 


76 




50 










DO 










6 0 










/ / 


Xaa 


Pro 


File 


Gly 


Xaa 


Leu 


Gly 


Met 


Ser 


Gly 


Gly 


Phe 


Met 


Asp 


Met 


Phe 


7ft 


65 










/ U 










75 








80 




Gly 


Met 


Met 


Asn 


Asp 


Met 


Xaa 


Gly 


Asn 


Met 


Glu 


His 


Met 


Thr 


Ala 


Gly 


ft 0 










85 










90 










95 


ft 1 
O J. 
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Ser 


Ser 
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He 
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Tyr 


Ser 
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ft 9 








100 
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ft 
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Val 
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Glu 
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Glu 


Met 


Arg 


ftA 
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ft R 
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Arg 
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Ser 


ft ^ 
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Leu 
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Cjin 
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He 
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He 
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88 
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1 
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1 DO 










150 


89 
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His 


He 
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Arg 
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Arg 
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90 
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175 




91 
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Leu 
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92 








180 
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93 
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Trp 
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Gin 
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94 






195 










200 










205 








95 
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Phe 


Arg 


Arg 


Leu 


Glu 
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Gly 
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Gly 


Gly 


Arg 


Arg 


Ala 


96 
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220 










97 


Glu 


Gly 


Pro 


Pro 


Arg 


Leu 


Ala 


He 


Gin 


Gly 


Pro 


Glu 


Asp 


Ser 


Leu 


Pro 


98 


225 










230 










235 










240 


99 


Asp 


Ser 


Pro 


Ala 


Ala 


Met 


Thr 


Gly Glu 


Gly 


Pro 


Gly 


Ala 


Ser 


Ala 


Leu 



100 245 250 255 

101 Leu Tyr Arg Leu Arg Gly 

102 260 

104 (2) INFORMATION FOR SEQ ID NO: 2: 

106 (i) SEQUENCE CHARACTERISTICS: 

107 (A) LENGTH: 1322 base pairs 

108 (B) TYPE: nucleic acid 

109 (C) STRANDEDNESS : single 

110 (D) TOPOLOGY: linear 

112 (vii) IMMEDIATE SOURCE: 

113 (A) LIBRARY: BRAITUT02 

114 (B) CLONE: 762280 

116 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

118 GGGGGGCGTA CGGAGGTGGC AGCTGTGGGA GGAGGCGGCG TGGAAGGCCG AGGAGCTCAA 60 

119 GCCCGGACCA ATCCCCACGT TCCGGGCCGC CACCCTGACC CTGCAGCGTA CCGGGAAGCG 120 
12 0 AAACCGGCCG GATGGGCCGC TGAGCCCGAA TCGGGCACTG TGTGGAGCCC CCTGGAGCTG 180 

121 AGATCAGGAT GTTCCGCTTC ATGAGGGACG TGGAGCCTGA GGATCCCATG TTCCTGATGG 24 0 

122 ATCCCTTTGC TATTCACCGT CAGCATATGA GCCGTATGTT GTCAGGTGGC TTTGGATATA 300 
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12 3 GCCCCTTCCT CAGCATCACA GATGGCAACA TGCCAGGGAC CAGGGCTGCC AGCCGCCGGA 360 

124 TGCAGCAGGC TGGAGCTGTC TNCCCCTTTG GGNTGCTGGG AATGTCGGGT GGTTTCATGG 4 20 

12 5 ACATGTTTGG GATGATGAAT GACATGNTTG GAAACATGGA ACACATGACA GCTGGAGGCA 4 80 

12 6 ATTGCCAGAC CTTCTCATCT TCCACTGTCA TCTCCTACTC CAATACGGGT GATGGTGCCC 54 0 

127 CCAAGGTCTA CCAAGAGACA TCAGAGATGC GCTCGGCACC AGGCGGGATC CGGGAGACAC 600 

128 GGAGGACTGT TCGGGATTCA GACAGTGGAC TGGAGCAGAT GTCCATTGGG CATCACATCC 660 

129 GGGACAGGGC TCACATCCTC CAGCGCTCCC GAAACCATCG CACGGGGGAC CAGGAGGAGC 720 

130 GGCAGGACTA TATCAACCTG GATGAGAGTG AGGCCGCAGC GTTTGATGAC GAGTGGCGGC 780 

131 GGGAGACCTC CCGATTCCGG CAGCAGCGTC CCCTGGAGTT TCGGCGGCTT GAGTCCTCAG 840 

132 GGGCTGGGGG ACGAAGGGCG GAGGGGCCTC CCCGCCTGGC CATCCAGGGA CCTGAGGACT 900 

133 CCCTTCCCGA CAGTCCCGCC GCTATGACTG GTGAGGGCCC CGGGGCCTCA GCTCTCTTGT 96 0 

134 ACAGGCTGAG AGGCTGAGAA ATCATCCCCT GAATAACTTT TTCCTCTCGA TTCCCATCCC 1020 

135 CAATTTAATA TTAAATTAAC AGGCAAGCCG GCCCCCACCT CTCCCTGGGG GTCTCAGGGA 1080 

136 GAACCTTTCA CGGCACCCTT TCCCTACCTT TTCCTTCTTT AATCTCCTGG TTTACCATTG 114 0 

137 ATGACTTCGG CTCTGCATCT ACTTACTTGA TTTTTCATTC TGCCACTTCA TCTTCAAACC 12 00 

138 CCCTCACCTT TCCCATCCTA CTCCTGCCAT GCATTGAAGG GTCAATGCAT TTTGGGGTGA 1260 

139 GNTTNGGTTT AGGGGCCCCT TCATNCCTNA GCTACCTGGG TCTTTGCCCA ACTTTTCTCA 132 0 

140 GA 1322 
142 (2) INFORMATION FOR SEQ ID NO: 3: 

144 (i) SEQUENCE CHARACTERISTICS: 

145 (A) LENGTH: 268 amino acids 

146 (B) TYPE: amino acid 

147 (C) STRANDEDNESS: single 

148 (D) TOPOLOGY: linear 

150 (vii) IMMEDIATE SOURCE: 

151 (A) LIBRARY: GenBank 

152 (B) CLONE: 1066392 

154 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

156 Met Phe Arg Met Leu Asn Ser Ser Phe Glu Asp Asp Pro Phe Phe Ser 

157 15 10 15 

158 Glu Ser lie Leu Ala His Arg Glu Asn Met Arg Gin Met lie Arg Ser 

159 20 25 30 

160 Phe Ser Glu Pro Phe Gly Arg Asp Leu Leu Ser He Ser Asp Gly Arg 

161 35 40 45 

162 Gly Arg Ala His Asn Arg Arg Gly His Asn Asp Gly Glu Asp Ser Leu 

163 50 55 60 

164 Thr His Thr Asp Val Ser Ser Phe Gin Thr Met Asp Gin Met Val Ser 

165 65 70 75 80 

166 Asn Met Arg Asn Tyr Met Gin Lys Leu Glu Arg Asn Phe Gly Gin Leu 

167 85 90 95 

16 8 Ser Val Asp Pro Asn Gly His Ser Phe Cys Ser Ser Ser Val Met Thr 

169 100 105 110 

170 Tyr Ser Lys He Gly Asp Glu Pro Pro Lys Val Phe Gin Ala Ser Thr 

171 115 120 125 

172 Gin Thr Arg Arg Ala Pro Gly Gly He Lys Glu Thr Arg Lys Ala Met 

173 130 135 140 

174 Arg Asp Ser Asp Ser Gly Leu Glu Lys Met Ala He Gly His His He 

175 145 150 155 160 

176 His Asp Arg Ala His Val He Lys Lys Ser Lys Asn Lys Lys Thr Gly 
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177 165 170 175 

178 Asp Glu Glu Val Asn Gin Glu Phe lie Asn Met Asn Glu Ser Asp Ala 

179 180 185 190 

180 His Ala Phe Asp Glu Glu Trp Gin Ser Glu Val Leu Lys Tyr Lys Pro 

181 195 200 205 

182 Gly Arg His Asn Leu Gly Asn Thr Arg Met Arg Ser Val Gly His Glu 

183 210 215 220 

184 Asn Pro Gly Ser Arg Glu Leu Lys Arg Arg Glu Lys Pro Gin Gin Ser 

185 225 230 235 240 

186 Pro Ala lie Glu His Gly Arg Arg Ser Asn Val Leu Gly Asp Lys Leu 

187 245 250 255 

188 His lie Lys Gly Ser Ser Val Lys Ser Asn Lys Lys 

189 260 265 
191 (2) INFORMATION FOR SEQ ID NO: 4: 

193 (i) SEQUENCE CHARACTERISTICS: 

194 (A) LENGTH: 1116 base pairs 

195 (B) TYPE: nucleic acid 

196 (C) STRANDEDNESS : single 

197 (D) TOPOLOGY: linear 

199 (vii) IMMEDIATE SOURCE: 

200 (A) LIBRARY: GenBank 

201 (B) CLONE: 1066391 

203 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

205 GTTATGTGTT CCCGTCCGTA CTGGAGGCTA GCTCTTGTCG CGGCCGCGGC GAGTTAACAT 6 0 

206 CGTTTTTCCA ATCTGTCCGC GGCTGCCGCC ACCCAAGACA GAGCCAGAAT GTTCAGGATG 120 

207 CTGAACAGCA GTTTTGAGGA TGACCCCTTC TTCTCTGAGT CCATTCTTGC ACACCGAGAA 180 

208 AATATGCGAC AGATGATAAG AAGTTTTTCT GAACCCTTTG GAAGAGACTT GCTCAGTATC 24 0 

209 TCTGATGGTA GAGGGAGAGC TCATAATCGT AGAGGACATA ATGATGGTGA AGATTCTTTG 3 00 

210 ACTCATACAG ATGTCAGCTC TTTCCAGACC ATGGACCAAA TGGTGTCAAA TATGAGAAAC 36 0 

211 TATATGCAGA AATTAGAAAG AAACTTCGGT CAACTTTCAG TGGATCCAAA TGGACATTCA 420 

212 TTTTGTTCTT CCTCAGTTAT GACTTATTCC AAAATAGGAG ATGAACCGCC AAAGGTTTTT 4 80 

213 CAGGCCTCAA CTCAAACTCG TCGAGCTCCA GGAGGAATAA AGGAAACCAG GAAAGCAATG 540 

214 AGAGATTCTG ACAGTGGACT AGAAAAAATG GCTATTGGTC ATCATATCCA TGACCGAGCT 600 

215 CATGTCATTA AAAAGTCAAA GAACAAGAAG ACTGGAGATG AAGAGGTCAA CCAGGAGTTC 660 

216 ATCAATATGA ATGAAAGCGA TGCTCATGCT TTTGATGAGG AGTGGCAAAG TGAGGTTTTG 72 0 

217 AAGTACAAAC CAGGACGACA CAATCTAGGA AACACTAGAA TGAGAAGTGT TGGCCATGAG 780 

218 AATCCTGGCT CCCGAGAACT TAAAAGAAGG GAGAAACCTC AACAAAGTCC AGCCATTGAA 84 0 

219 CATGGAAGGA GATCAAATGT TTTGGGGGAC AAACTCCACA TCAAAGGCTC ATCTGTGAAA 900 

220 AGCAACAAAA AATAAATAGC CATGCATTTG ATTTGTTTAG TTTTGATTGT TTTAACAGTT 96 0 

221 AGTAATGGTG CTGGGTAATA AGCATAAGAC CAATCTCTTG CTGTTAAATC AGTTCTGTCC 102 0 
22 2 TTGGCAACTT TCTTCTGATA TCTGAATGTT CATGAAGGTC CTAGCTTTAT ATTGTCCCTC 1080 
223 TTTTAGGAAT AAAATTTTGA TTTTCAACAA AAAAAA 1116 
22 5 (2) INFORMATION FOR SEQ ID NO : 5: 

227 (i) SEQUENCE CHARACTERISTICS: 

228 (A) LENGTH: 248 amino acids 

229 (B) TYPE: amino acid 

230 (C) STRANDEDNESS: single 
2 31 (D) TOPOLOGY: linear 
233 (vii) IMMEDIATE SOURCE: 
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234 (A) LIBRARY: GenBank 

235 (B) CLONE: 1399745 

237 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

239 Met Phe Arg Phe Met Arg Asp Val Glu Pro Glu Asp Pro Met Phe Leu 

240 15 10 15 

241 Met Asp Pro Phe Ala He His Arg Gin His Met Ser Arg Met Leu Ser 

242 20 25 30 

24 3 Gly Gly Phe Gly Tyr Ser Pro Phe Leu Ser He Thr Asp Gly Asn Met 

244 35 40 45 

24 5 Pro Gly Thr Arg Pro Ala Ser Arg Arg Met Gin Gin Ala Gly Ala Val 

246 50 55 60 

24 7 Ser Pro Phe Gly Met Leu Gly Met Ser Gly Gly Phe Met Asp Met Phe 

248 65 70 75 80 

24 9 Gly Met Met Asn Asp Met He Gly Asn Met Glu His Met Thr Ala Gly 

250 85 90 95 

2 51 Gly Asn Cys Gin Thr Phe Ser Ser Ser Thr Val He Ser Tyr Ser Asn 

252 100 105 110 

253 Thr Gly Asp Gly Ala Pro Lys Val Tyr Gin Glu Thr Ser Glu Met Arg 

254 115 120 125 

2 55 Ser Ala Pro Gly Gly He Arg Glu Thr Arg Arg Thr Val Arg Asp Ser 

256 130 135 140 

257 Asp Ser Gly Leu Glu Gin Met Ser He Gly His His He Arg Asp Arg 

258 145 150 155 160 

259 Ala His He Leu Gin Arg Ser Arg Asn His Arg Thr Gly Asp Gin Glu 

260 165 170 175 

261 Glu Arg Gin Asp Tyr He Asn Leu Asp Glu Ser Glu Ala Ala Ala Phe 

262 180 185 190 

263 Asp Asp Glu Trp Arg Arg Glu Thr Ser Arg Phe Arg Gin Gin Arg Pro 

264 • 195 200 205 

265 Leu Glu Phe Arg Arg Leu Glu Ser Ser Gly Ala Gly Gly Arg Arg Ala 

266 210 215 , 220 

267 Glu Gly Pro Pro Arg Leu Ala He Gin Gly Pro Glu Asp Ser Pro Ser 

268 225 230 235 240 
26 9 Arg Gin Ser Arg Arg Tyr Asp Trp 

270 245 

272 (2) INFORMATION FOR SEQ ID NO : 6: 

274 (i) SEQUENCE CHARACTERISTICS: 

275 (A) LENGTH: 1502 base pairs 

276 (B) TYPE: nucleic acid 

277 (C) STRANDEDNESS : single 

278 (D) TOPOLOGY: linear 

280 (vii) IMMEDIATE SOURCE: 

281 (A) LIBRARY: GenBank 

282 (B) CLONE: 1399744 

284 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

286 CTCTAAAGGG CAGCTGTGGG AGGAGGCGGC GTGGAAGGCC GAGGAGCTCA AGCCCGGACC 60 

287 AATCCCCACG TTCCGGGCCG CGACCCTGAC CCTGCAGCGT ACCGGGAAGC GAAACCGGCC 12 0 
2 88 GGATGGGCCG CTGAGCCCGA ATCGGGCACT GTGTGGAGCC CCCTGGAGCT GAGATCAGGA 180 
2 89 TGTTCCGCTT CATGAGGGAC GTGGAGCCTG AGGATCCCAT GTTCCTGATG GATCCCTTTG 240 
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L:5 M:220 C: Keyword misspelled or invalid format, [(1) GENERAL INFORMATION:] 

L:10 M:220 C: Keyword misspelled or invalid format, [(ii) TITLE OF INVENTION:] 

L:30 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 

L:31 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:35 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 

L:36 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:77 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 

L:79 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
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